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1.  Overview 


This  technical  note  provides  a  brief  description  of  the  US  Army  Research 
Laboratory  (ARL)  Arabic  Dependency  Treebank  (AADT)  version  1.0  for  the 
purpose  of  documenting  its  release.  AADT  was  automatically  derived  from  4 
existing  Linguistic  Data  Consortium  (LDC)  resources — the  latest  versions  of  the 
Arabic  Treebank  (ATB)  parts  1,  2,  and  3,  as  well  as  the  Arabic  Treebank  Broadcast 
News  dataset  (Maamouri,  et  al.;  LDC2010T13,  LDC2011T09,  LDC2010T08, 
LDC2012T07).  The  original  ATB  contains  over  2,000  news  stories  produced  by  a 
handful  of  Arabic  news  services.  Each  story  was  annotated  by  the  LDC,  with  every 
token  receiving  its  appropriate  part-of-speech  tag  and  morphological  segmentation, 
and  every  sentence  being  annotated  with  its  constituent  parse.  For  AADT,  we 
created  dependency  parses  for  latest  version  of  the  conversion  process  briefly 
described  in  Section  4.5  of  the  paper  “A  Cross-Task  Flexible  Transition  Model  for 
Arabic  Tokenization,  Affix  Detection,  Affix  Fabeling,  POS  Tagging,  and 
Dependency  Parsing”  (Tratz,  2013).  An  earlier  version  of  this  dependency  treebank 
was  also  used  in  the  paper  “Resumptive  Pronoun  Detection  for  Modem  Standard 
Arabic  to  English  MT”  (Tratz,  2014).  The  FDC  is  one  of  the  foremost  sources  of 
annotated  data  used  in  computational  linguistics,  and,  by  releasing  this  dependency 
treebank  back  to  them  for  redistribution,  we  hope  to  facilitate  Arabic  natural 
language  processing  research  by  the  greater  community. 

The  remainder  of  this  technical  note  defines  the  dependency  tree  file  format 
(Section  2),  and  presents  the  part-of-speech  tag  (Section  3)  and  dependency  label 
(Section  4)  schemes  used  throughout  the  AADT. 

2.  File  Format 


The  files  are  in  an  11 -column  tab-separated  format  with  one  or  more  blank  lines 
between  sentences.  All  files  are  UTF-8  encoded.  An  example  is  presented  below. 
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1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

0/0 

core 

-1/0 

ROOT 

mDy 

>amoDaY  1 

AmDY 

VB  PV 

spend/accomplish/.. 

1/0 

core 

0/0 

subj 

rkb 

rAkibl 

rkAb 

NOUN 

riders/passengers 

2/0 

pref 

- 

- 

- 

- 

j 

A1 

DET 

the 

2/1 

core 

1/0 

idafa 

Tyr 

TA}irap_l 

jjlL 

TA}r 

NOUN 

aircraft/airplane 

2/2 

suff 

- 

- 

- 

- 

O 

P 

NSUFF  FEM 

SG 

[fem.sg.] 

3/0 

pref 

- 

- 

- 

- 

Jl 

A1 

DET 

the 

3/1 

core 

2/1 

amod 

sEd 

saEuwdiy~_l 

ui 

sEwdy 

ADJ 

Saudi 

4/0 

core 

0/0 

obj 

lyl 

layolapl 

lyl 

NOUN 

night/evening/ soire 

4/1 

suff 

- 

- 

- 

- 

t 

NSUFF  FEM 

SG 

[fem.sg.] 

4/2 

core 

4/0 

idafa 

- 

- 

hm 

PRON  AG  3 

MASCPL 

their 

5/0 

pref 

- 

- 

- 

- 

Jl 

A1 

DET 

the 

5/1 

core 

4/0 

amod 

mDy 

mADiyl 

mADy 

ADJ 

past/bygone 

5/2 

suff 

- 

- 

- 

- 

0 

P 

NSUFF  FEM 

SG 

[fem.sg.] 

6/0 

core 

0/0 

prep 

fy 

fiyl 

fy 

PREP 

in 

7/0 

core 

6/0 

pcomp 

fndq 

funoduq_l 

fndq 

NOUN 

hotel 

The  values  of  the  1 1  columns  are  as  follows: 


1)  Unique  identifier  for  a  particular  word/afllx.  The  first  number  indicates  the 
whitespace/punctuation-separated  token  it  belongs  to;  the  second  number 
indicates  the  morpheme  within  the  token. 

2)  One  of  3  values  (core — the  “core”  part  of  a  word;  “pref  ’ — prefix;  “suff  ’ — 
suffix).  The  term  core  was  chosen  to  avoid  linguistically  loaded  terms  such 
as  stem  or  root.  It  is  worth  noting  that  clitics  are  split  off  from  the  remainder 
of  the  word  and  are  marked  as  cores  to  indicate  their  word-level  status 
within  the  conversion.  Since  clitics  appear  frequently  in  Arabic,  it  is  not 
unusual  for  a  single  token  to  have  multiple  “core”  lines  associated  with  it. 
Although  the  definite  determiner  A l  is  considered  a  clitic,  it  is  labeled  as  a 
“pref’  for  the  sake  of  convenience — there  is  never  any  question  as  to  where 
it  attaches  in  a  dependency  tree. 

3)  Identifier  of  the  governing  word. 

4)  Label  of  the  dependency  on  the  edge. 

5)  The  root  of  the  token.  Although  the  ATB’s  integrated  format  specifies  the 
SAMA  (Maamouri  et  ah,  LDC2010L01)  lemma  identifiers  for  the  words,  it 
does  not  provide  the  root  that  the  lemma  is  derived  from  (Most  roots  in 
Arabic  are  sequences  of  3  or  4  characters).  Therefore,  the  values  in  this  field 
were  populated  automatically  using  a  utility  program  that  accesses  the 
SAMA  database. 

6)  The  lemma  identifier  in  the  SAMA  database  for  the  given  word. 

7)  Original  text. 

8)  Transliterated  text  (transliterated  using  the  popular  Buckwalter 
transliteration  scheme). 
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9)  Part-of-speech  label. 

10)  Gloss  (definition). 


11)  Sparsely  populated  field  used  to  indicate  co-indexing  for  resumptive 
pronouns/ affixes . 

Thus,  taken  together,  fields  1,  2,  and  3  define  the  labeled  dependency  edges 
between  all  the  “core”  elements  of  the  sentence.  The  “pref  ’  and  “suff  ’  morphemes 
are  implicitly  linked  with  their  adjacent  cores,  and  many  of  their  fields  are  left 
empty  (indicated  by  a  hyphen). 

3.  Part-of-Speech  (POS)  Tag  Scheme 

The  POS  tag  scheme  is  similar  to  the  scheme  used  by  the  ATB  but  has  a  variety  of 
modifications.  One  important  note  is  that  any  portion  of  the  original  POS  label  that 
corresponds  to  an  unwritten  portion  of  a  word  is  simply  dropped.  For  example,  if 
the  original  label  for  the  token  was  DET+NOUN+CASE_DEF_GEN  but  the  final 
kaSra  short  vowel  diacritic — the  typical  indicator  of  genitive  case — was  not 
written,  the  DET  label  would  appear  on  one  line  with  the  Al  definite  determiner, 
the  NOUN  label  would  appear  on  a  line  with  the  core  noun  text,  and  the 
C  ASE  DEF  GEN  portion  would  simply  be  dropped  because  it  does  not  correspond 
to  a  written  morpheme.  Most  of  the  mappings  are  1  -to- 1  and  should  be  fairly  clear 
to  anyone  who  is  already  familiar  with  the  ATB  POS  tag  scheme;  for  example, 
PVSUFF_SUBJ:2FS  is  rewritten  as  PS_2_FEM_SG.  A  list  of  all  the  POS  tags  is 
provided  in  Appendix  A. 

4.  Dependency  Label  Scheme 

For  ease  of  understanding,  many  of  the  dependency  labels  have  names  that  are 
similar  or  identical  to  the  most  similar  dependency  labels  in  the  popular  Stanford 
English  dependency  label  scheme  (de  Mameffe  &  Manning,  2008).  However,  this 
is  to  not  to  say  that  they  may  always  be  interpreted  identically.  Also,  some  labels 
are  specific  to  Arabic,  including  idafa,  fidafa,  and  kccmp.  A  complete  listing  of  the 
AATB  dependency  labels  is  given  in  Appendix  B. 


Approved  for  public  release;  distribution  is  unlimited. 

3 


5.  References 


Badawi  El-Said,  Carter  Michael  G,  Gully  Adrian.  Modem  written  arabic:  A 
comprehensive  grammar.  Routledge,  2004. 

de  Marneffe,  Marie-Catherine,  Manning  Christopher  D.  The  Stanford  typed 
dependencies  representation.  COLING  2008:  Proceedings  of  the  Workshop  on 
Cross-Framework  and  Cross-Domain  Parser  Evaluation.  Association  for 
Computational  Linguistics,  2008. 

Maamouri  Mohamed,  Bies  Ann,  Kulick  Seth,  Gaddeche  Fatma,  Mekki  Wigdan, 
Krouna  Sondos,  Bouziri  Basma,  Zaghouani  Wajdi.  Arabic  treebank:  Part  1  v 

4.1  LDC2010T13.  Web  Download.  Philadelphia:  Linguistic  Data  Consortium, 
2010. 

Maamouri  Mohamed,  Bies  Ann,  Kulick  Seth,  Gaddeche  Fatma,  Mekki  Wigdan, 
Krouna  Sondos,  Bouziri  Basma,  Zaghouani  Wajdi.  Arabic  treebank:  Part  2  v 

3.1  LDC201 1T09.  Web  Download.  Philadelphia:  Linguistic  Data  Consortium, 
2011. 

Maamouri  Mohamed,  Bies  Ann,  Kulick  Seth,  Krouna  Sondos,  Gaddeche  Fatma, 
Zaghouani  Wajdi.  Arabic  treebank:  Part  3  v  3.2  LDC2010T08.  Web 
Download.  Philadelphia:  Linguistic  Data  Consortium,  2010. 

Maamouri  Mohamed,  Bies  Ann,  Kulick  Seth,  Krouna  Sondos,  Tabassi  Dalila,  Ciul 
Michael.  Arabic  treebank  -  Broadcast  News  vl.O  LDC2012T07.  Web 
Download.  Philadelphia:  Linguistic  Data  Consortium,  2012. 

Maamouri  Mohamed,  Graff  David,  Bouziri  Basma,  Krouna  Sondos,  Bies  Ann, 
Kulick  Seth.  LDC  Standard  Arabic  Morphological  Analyzer  (SAMA)  version 

3.1  LDC2010L01.  Web  Download.  Philadelphia:  Linguistic  Data  Consortium, 

2010. 

Ryding  Karin  C.  A  reference  grammar  of  modem  standard  arabic.  Cambridge 
University  Press,  2005. 

Tratz  Stephen.  A  cross-task  flexible  transition  model  for  arabic  tokenization,  affix 
detection,  affix  labeling,  POS  tagging,  and  dependency  parsing.  Proceedings 
of  the  4th  Workshop  on  Statistical  Parsing  of  Morphologically  Rich 
Languages.  2013. 

Tratz  Stephen,  Voss  Clare,  Laoudi  Jamal.  Resumptive  pronoun  detection  for 
modern  standard  arabic  to  english  MT.  Proceedings  of  the  3rd  Workshop  on 
Hybrid  Approaches  to  Translation  (HyTra)@  EACL.  2014. 


Approved  for  public  release;  distribution  is  unlimited. 

4 


Appendix  A.  Part-of-Speech  Tag  List 


Approved  for  public  release;  distribution  is  unlimited. 

5 


The  following  is  a  complete  list  of  the  part-of-speech  tags  that  appear  within  the 
AADT.  For  convenience,  the  list  is  split  into  3  sublists  based  upon  whether  the  tag 
is  used  with  elements  labeled  as  “core,”  “pref,”  or  “suff,”  respectively. 


A-l  Part-of-Speech  Labels  for  'core' 


ABBREV 

ADJ 

AD J_COMP 

AD J_NUM 

ADV 

CONJ 

CONNE  C_P ART 

#DEM_PRON  -  Demonstrative 

DEM_PRON 

DEM_PRON_FEM 

DEM_PRON_FEM_DU 

DEM_PRON_FEM_SG 

DEM_PRON_MASC_DU 

DEM_PRON_MASC_SG 

DEM_PRON_PL 

DIALECT 

EMPHAT I C_PART 

EXCLAM_PRON 

FOCUS_PART 

FOREIGN 

FUT_PART 

INNA 

INTERJ 

INTERROG_PART 

JUS_PART 

LATIN 

NEG_PART 

NOUN 

NOUN_NUM 

NOUN_PROP 

NOUN_QUANT 

NUMERIC_COMMA 

PART 

PARTIAL 

PREP 


#PRON  -  Pronoun 
PRON_AG_l_PL 

PRON_AG_l_SG 

PRON_AG_2_DU 
PRON  AG  2  FEM  PL 


Abbreviation 
Adj  ective 

Comparative  or  superlative 
adj  ective 
Ordinal  number 
Adverb 

Coordinating  conjunction 
Connective  particle 
pronoun 

(Non-specific  gender  and  number) 

Feminine 

Feminine  dual 

Feminine  singular 

Masculine  dual 

Masculine  singular 

Plural 

Dialect 

Emphatic  particle 

Exclamatory  pronoun 

Focus  particle 

Foreign  transliterated  word 

Future  particle 

Inna  or  one  of  her  sisters 

Inter j  ection 

Interrogative  particle 

Jussive  particle 

Latin  script  token 

Negative  particle 

Noun 

Cardinal  number 
Proper  noun 
Quantifier  noun 

Numeric  comma  (letter  reh  used  as 
comma) 

Particle 

Partial 

Preposition  (true  prepositions 
only) 


Accusative /genitive 
plural 

Accusative /genitive 
singular 

Accusative/ genitive 
Accusative /genitive 
feminine  plural 


1st  person 

1st  person 

2nd  person  dual 
2nd  person 
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PR0N_AG_2_FEM_SG 

PR0N_AG_2_MASC_PL 

PR0N_AG_2_MASC_SG 

PR0N_AG_3_DU 

PR0N_AG_3_FEM_PL 

PR0N_AG_3_FEM_SG 

PR0N_AG_3_MASC_PL 

PR0N_AG_3_MASC_SG 

PR0N_N0M_1_PL 

PR0N_N0M_1_SG 

PR0N_N0M_2_FEM_SG 

PR0N_N0M_2_MASC_PL 

PR0N_N0M_2_MASC_SG 

PR0N_N0M_3_DU 

PR0N_N0M_3_FEM_PL 

PR0N_N0M_3_FEM_SG 

PR0N_N0M_3_MASC_PL 

PR0N_N0M_3_MASC_SG 

PUNC 

RC_PART 

RESTRIC_PART 

RI_ADV 

#RI_PRON  -  Relative  or 
RI_PRON_DEF 
RI_PRON_FEM_DU_AG_DE  F 

RI_PRON_FEM_DU_NOM_DEF 

RI_PRON_FEM_PL_DEF 

RI_PRON_FEM_SG_DEF 

RI_PRON_INDEF 

RI  PRON  MASC  DU  AG  DEF 


Accusative/genitive  2nd  person 
feminine  singular 
Accusative/genitive  2nd  person 
masculine  plural 
Accusative/genitive  2nd  person 
masculine  sing. 

Accusative/genitive  3rd  person  dual 
Accusative/genitive  3rd  person 
feminine  plural 
Accusative/genitive  3rd  person 
feminine  singular 
Accusative/genitive  3rd  person 
masculine  plural 
Accusative/genitive  3rd  person 
masculine  singular 
Nominative  1st  person  plural 
Nominative  1st  person  singular 
Nominative  2nd  person  feminine 
singular 

Nominative  2nd  person  masculine 
plural 

Nominative  2nd  person  masculine 
singular 

Nominative  3rd  person  dual 
Nominative  3rd  person  feminine  dual 
Nominative  3rd  person  feminine 
singular 

Nominative  3rd  person  masculine 
plural 

Nominative  3rd  person  masculine 

singular 

Punctuation 

Result  clause  particle  (introduces 
apodosis ) 

Restrictive  particle 
Relative  or  interrogative  adverbial 
interrogative  pronoun 
Definite 

Feminine  dual  accusative/genitive 
definite 

Feminine  dual  nominative  definite 
Feminine  plural  definite 
Feminine  singular  definite 
Indefinite 

Masculine  dual  accusative/genitive 
definite 


RI  PRON  MASC  DU  NOM  DEF  Masculine  dual  nominative  definite 


RI_PRON_MASC_PL_DEF  Masculine  plural  definite 

RI_PRON_MASC_SG_DEF  Masculine  singular  definite 

SUB_CONJ  Subordinating  conjunction 

TRANSERR  Transcription  error 

TYPO  Typo 

VB  CV  Command  verb 


VB  IV 


Imperfect  verb 
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VB_IV_PASS 

VB_PV 

VB_PV_PASS 

VERB_PART 

VOC  PART 


Imperfect  verb  passive  voice 
Perfect  verb 

Perfect  verb  passive  voice 
Verb  particle  (laqado  and  qado) 
Vocative  particle 


A-2  Part-of-Speech  Labels  for  'pref' 


DET 


#IP  -  Imperfect 
IP_1_PL 
I P_1_SG 
I p_2_DU 
I p_2_FEM_pL 

IP_2_FEM_SG 
I P_2_MASC_PL 
I P_2_MASC_SG 
I P_3_FEM_DU 
I P_3_FEM_PL 
I P_3_FEM_SG 
I P_3_MASC_DU 
I P_3_MASC_PL 
IP  3  MASC  SG 


Determiner  (NOTE:  On  occasion,  the  A1 
determiner  will  appear  unattached;  in  this 
relatively  rare  situation,  the  determiner  is 
considered  a  'core' .) 
verb  prefix 


1st 

person 

plural 

1st 

person 

singular 

2  nd 

person 

dual 

2  nd 

person 

feminine 

plural 

2  nd 

person 

feminine 

singular 

2  nd 

person 

masculine 

plural 

2  nd 

person 

masculine 

singular 

3  rd 

person 

feminine 

dual 

3  rd 

person 

feminine 

plural 

3  rd 

person 

feminine 

singular 

3  rd 

person 

masculine 

dual 

3  rd 

person 

masculine 

plural 

3  rd 

person 

masculine 

singular 

A-3  Part-of-Speech  Labels  for  'suff' 


#CASE  -  Case  marke 

CASE_DEF_ACC 

CASE_DE  F_GEN 

CASE_DEF_NOM 

CASE_INDEF_ACC 

CASE_INDEF_GEN 

CASE_INDEF_NOM 

#CS  -  Command  verb 

CS_2_DU 

CS_2_FEM_SG 

CS_2_MASC_PL 

CS_2_MASC_SG 
#IS  -  Imperfect  ve 
I S_2_FEM_SG_i 
I S_2_FEM_SG_S_j 

I S_DU_i 

1 S_DU_s_ j 

IS_FEM_PL 

IS_MASC_PL_i 

I S_MASC_PL_s_j 

IS_i 

IS_j 

IS  s 


Definite  accusative 
Definite  genitive 
Definite  nominative 
Indefinite  accusative 
Indefinite  genitive 
Indefinite  nominative 
suffix 

2nd  person  dual 
2nd  person  feminine  singular 
2nd  person  masculine  plural 
2nd  person  masculine  singular 
rb  suffix 

2nd  person  feminine  singular  indicative 

2nd  person  feminine  singular 

subjunctive/ jussive 

Dual  indicative 

Dual  subjunctive/ jussive 

Feminine  plural 

Masculine  plural  indicative 

Masculine  plural  subjunctive/ jussive 

Indicative 

Jussive 

Subj  unctive 
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#NS  -  Nominal  suffix 

NS_FEM_DU_AG 

NS_FEM_DU_AG_POSS 

NS_FEM_DU_NOM 

NS_FEM_DU_NOM_POSS 

NS_FEM_PL 

NS_FEM_SG 

NS_MASC_DU_AG 

N  S_MAS  C_DU_AG_PO  S  S 

NS_MASC_DU_NOM 
NS_MASC_DU_NOM_POSS 
NS_MASC_PL_AG 
NS  MASC  PL  AG  POSS 


NS_MASC_PL_NOM 
NS_MASC_PL_NOM_POSS 
#PS  -  Perfect  verb  suffix 
PS_1_PL 
PS_1_SG 
PS_2_FEM_SG 
PS_2_MASC_PL 
PS_2_MASC_SG 
PS_3_FEM_DU 
PS_3_FEM_PL 
PS_3_FEM_SG 
PS_3_MASC_DU 
PS_3_MASC_PL 
PS  3  MASC  SG 


Feminine  dual  accusative/genitive 
Feminine  dual  accusative/genitive 
possessed 

Feminine  dual  nominative 
Feminine  dual  nominative  possessed 
Feminine  plural 
Feminine  singular 

Masculine  dual  accusative/genitive 
Masculine  dual  accusative/genitive 
possessed 

Masculine  dual  nominative 
Masculine  dual  nominative  possessed 
Masculine  plural  accusative/genitive 
Masculine  plural  accusative/genitive 
possessed 

Masculine  plural  nominative 
Masculine  plural  nominative  possessed 


1st 

person 

plural 

1st 

person 

singular 

2  nd 

person 

singular 

2  nd 

person 

masculine 

plural 

2  nd 

person 

masculine 

singular 

3  rd 

person 

feminine 

dual 

3  rd 

person 

feminine 

plural 

3  rd 

person 

singular 

3  rd 

person 

masculine 

dual 

3  rd 

person 

masculine 

plural 

3  rd 

person 

masculine 

singular 

Approved  for  public  release;  distribution  is  unlimited. 

9 


Intentionally  Left  Blank. 
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Appendix  B.  Dependency  Labels  List 
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The  following  is  a  complete  list  of  the  dependency  labels  that  appear  within  the 
AADT. 


adv  Adverbial  modifier;  includes  adverbs  as  well  as  other  words  used 
adverbially  including  various  preposition-like  nouns  and  other  words. 

advcl  Adverbial  clause  or  adverbial  clause-like  structure. 

a  mod  Adjectival  modifier. 

appos  Apposition.  (Ryding,  2005;  pp.  224-227) 

cc  Connects  a  coordinating  conjunction  with  a  preceding  conjunct.  Note  that 

lakinna,  a  sister  of  inna  that  translates  as  “however”  or  “but,”  may  occur  with  this 
dependency  despite  being  labeled  as  an  INNA  and  not  a  CONJ. 

ccomp  Clausal  complement. 

combo  Combination.  This  is  currently  only  used  with  a  handful  of  multi-word 
coordinating  conjunction  expressions. 

conj  Connects  a  conjunct  with  a  preceding  coordinating  conjunction  or 
conjunct. 

cop  Complement  of  copula. 

dep  Other  or  unknown  dependency. 

det  Detenniner. 

fidafa  “False”  idafa.  Unlike  typical  idafa  constructions,  which  are  headed  by 
nouns,  these  are  headed  by  adjectives.  (Ryding,  2005;  pp.  221-223) 

flat  Flat.  This  is  used  for  names  and  similar  phenomena  that  lack  syntactic 
structure,  or  at  least  any  syntactic  annotation. 

icc  Initial  coordinating  conjunction.  Arabic  sentences  frequently  begin  with 

a  coordinating  conjunction.  For  this  and  similar  situations,  icc  is  used  instead  of 
connecting  the  head  of  the  sentence  to  the  coordinating  conjunction  via  a  conj 
dependency. 

idafa  Idafa  construction.  Note  that  in  the  AADT,  the  "Quotation  or  title 
relationship"'(cf.  Ryding,  2005;  p.210)  is  treated  as  apposition  rather  than  idafa. 

intj  Interjection. 

iobj  Indirect  object. 
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kccomp  Clausal  complement  of  kAna.  This  is  separated  from  the  more  general 
ccomp  dependency  because  the  verb  kAna,  following  another  verb,  can  be  used  to 
express  continued  or  habitual  action  in  the  past  (Ryding,  2005;  pp.  446-447). 

neg  Negation. 

obj  Direct  object. 

ocomp  Object  complement. 

parataxis  Parataxis.  Used  to  connect  to  sentences  together  that  are  written  next 
to  each  other  but  that  are  not  connected  by  an  explicit  coordinating  conjunction. 

part  Particle  modifier.  Used  with  a  variety  of  different  particles,  including  the 
future  particle.  Note  that  NEG_PART  will  typically  appear  with  a  neg  dependency 
and  FOCUS  PART  is  treated  as  if  it  were  a  preposition. 

pcomp  Object/complement  of  a  true  preposition. 

prep  Preposition  modifier.  Links  a  true  preposition  to  its  governor. 

punct  Punctuation. 

reladv  Relative  adverbial  modifier. 

ricomp  Complement  of  a  relative  or  interrogative  pronoun/adverb. 

sc  Subordinating  conjunction.  Used  with  subordinating  conjunctions  other 

than  “inna  and  her  sisters.” 

subj  Subject.  This  may  occur  without  a  verb,  as  with  equational  sentences. 

tmz  Tamyiiz.  (Ryding,  2005) 

tpc  Topicalized  element  (not  including  topicalized  subjects). 

voc  Vocative. 

xrrcl  Relative  clause  with  an  explicit  relativizer. 

zrrcl  Relative  clause  without  an  explicit  relativizer  (zero  relativizer). 
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